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This  documentation  is  from  the  information  file  of  a 
computer  program,  SID,  that  illustrates  the  concept  of  stochastic 
indexing.  This  information  file  is  not  intended  to  be  a  stand¬ 
alone  report,  but  to  be  used  with  SID  to  quickly  understand  a 
concept  that  is  simple  for  a  progreunmer  to  implement  but  profound 
to  a  statistician  in  its  application. 

Purpose 

The  demo  program,  SID  (Stochastic  Indexing  Demo) ,  in  listing 
1  simulates  clouds  as  seen  by  a  satellite  looking  straight  down. 
As  a  pictorial  rendering  of  clouds,  it  is  rather  poor  since  it 
only  uses  circles  to  indicate  where  clouds  are.  But  with  a 
little  imagination,  one  can  see  structures  that  resemble  some 
types  of  clouds.  By  varying  the  random  number  seed,  the  number  of 
clouds,  puffs,  rows,  etc.,  a  wide  variety  of  patterns  can  be 
generated. 

However,  cloud  depiction  is  not  its  purpose.  Its  purpose  is 
to  demonstrate  a  new  method  of  "storing"  and  retrieving  randomly 
generated  features.  One  hundred  cloud  clusters  each  with  up  to 
30  puffs  (an  average  of  17)  are  generated  for  a  total  of  about 
1700  puffs.  The  location  and  size  of  each  of  these  puffs  can  be 
retrieved  very  fast.  The  computer  memory  storage  required  to 
"store"  this  information  is  ZERO! ! 
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Furthermore,  the  number  of  puffs  could  have  just  as  easily 
been  10  million  with  additional  information  on  liquid  water 
content,  vertical  velocity,  etc.  etc. ,  and  the  required  storage 
still  would  be  zero.  The  method  is  not  limited  to  clouds.  The 
same  method  could  be  used  for  simulated  trees  in  a  forest, 
simulated  fish  in  the  sea,  or  simulated  accounts  in  a  bank. 

The  trick  is  to  reseed  the  random  number  generator  using  the 
feature's  index  -  in  this  case  the  cloud's  identification  ntimber, 
1  to  100.  Some  backgroxind  on  random  number  generators  may  help 
understanding . 

PandQTii  Number  Generators 

Computer  random  number  generators  start  with  some  initial 
number,  operate  on  this  ntimber,  then  return  the  result  as  a 
"random  number".  Often  used  is  the  simple  congruential  generator: 

New_random_number  (Old_random_number  *  VI  +  V2)  mod  V3 
where  the  constants  VI,  V2,  and  V3  are  carefully  chosen  to 
emulate  randomness  and  keep  the  series  of  numbers  as  large  as 
possible  before  the  same  number  repeats.  Thus,  these  numbers  are 
not  random  at  all  but  have  a  definite  sequence  to  them.  [Extra 
Credit  Brain  Teaser:  Given  three  consecutive  random  numbers  from 
a  simple  congruential  generator,  how  can  VI,  V2,  and  V3  be 
calculated?] 

Some  computers  use  the  video  refresh  frame  number  as  a 
01d_random_number  to  obtain  varied  sequences.  I  once  worked  on  a 
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program  that  used  this  method  only  to  keep  getting  the  same 
random  sequence.  It  turned  out  that  at  the  command  RUN,  the  video 
refresh  frame  nxunber  was  reset  to  zero  and  the  seoiie 
01d_random_number  was  always  obtained.  Another  case  I  read  about 
used  an  analog  to  digital  converter  hooked  up  to  a  Geiger  counter 
reading  background  random  radiation  to  obtain  a  "true"  random 
number.  The  only  troiable  was  that  the  random  numbers  had  a 
distinct  period  in  them  that  coincided  with  a  nearby  revolving 
radar. 

In  any  case,  stochastic  indexing  makes  use  of  these 
reproducible  sequences  to  rapidly  regenerate  specific  information 
about  each  indexed  feature  such  as  location,  size,  etc. 

The  Demo  Program 

With  regard  to  the  demo  progreun  itself,  even  though  it  is 
written  in  SuperBasic,  programmers  (FORTRAN,  BASIC,  etc.)  will 
understand  most  lines.  But  a  few  sections  need  some  additional 
explanation. 

The  most  important  part  is  the  procedure  Ith_Cloud.  It  is 
used  to  initially  define  each  cloud's  parameters  when  called  from 
line  230  and  is  also  used  to  retrieve  the  ith  cloud's  pareuneters 
when  called  from  line  340. 
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Ith  Cloud  Spacer 


In  Ith_Cloud,  line  470  sets  the  random  number  seed  for  the 
ith  cloud.  The  99  times  the  index  is  a  spacer  so  that  the 
starting  number  for  each  cloud  is  99  items  further  along  in  the 
random  number  secpaence.  Without  the  99,  the  y  location  of  the 
first  cloud  would  be  the  x  location  of  the  second  cloud 
(sometimes!).  A  spacer  larger  than  the  largest  possible  niunber  of 
parameters  for  each  feature  keeps  the  seuae  sequences  from  being 
reused.  Sometimes  too  small  a  spacer  causes  no  noticeedsle 
effect;  other  times,  some  weird  things  happen. 

Acceptance/Rei ect ion 

The  algorithm  as  stated  so  far  will  give  a  uniformly  random 
distribution  of  clouds.  That  is,  any  point  is  just  as  likely  to 
have  a  cloud  as  any  other.  There  are  many  simulation  algorithms 
to  convert  uniform  distributions  to  some  desired  pattern.  The  one 
used  here  is  an  acceptance/ rejection  method  that  allows 
stochastic  indexing  to  produce  varied  patterns. 

Lines  480  to  510  contain  an  acceptance/ rejection  algorithm 
which  is  pretty  standard  in  simulation.  It  works  like  this:  A 
function  is  defined  over  the  area  that  is  close  to  1  where  you 
want  most  clouds,  close  to  zero  where  you  want  few  clouds,  and 
negative  where  you  want  no  clouds.  The  function  chosen  (feel  free 
to  chose  your  own)  here  is  COS(2*PI*Nrows*(x-y) )  where  Nrows 
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determines  the  number  of  cloud  rows  and  (x-y)  locates  them  in 
lower-left  to  upper-right  orientation.  The  Cosine  function 
varies  from  1  to  -1  as  x-y  varies. 

Next  a  potential  site,  x  and  y,  is  selected.  The  function  at  x 
and  y  (-1  to  1  in  this  case)  is  compared  to  a  random  number  (0  to 
1) .  If  the  function  is  larger  then  the  random  number,  the  site  is 
selected;  but  if  the  random  nximber  is  larger,  then  the  site  is 
rejected  and  a  new  pair  of  x  and  y  is  selected.  This  continues 
(REPeat  accept_reject)  until  a  site  is  selected.  Do  you  see  why 
sites  with  a  function  near  one  will  have  more  clouds? 

Any  function  can  be  used,  but  make  sure  it  is  positive  at 
least  in  some  areas  or  the  REPeat  loop  will  never  be  exited.  Also 
functions  with  small  areas  of  positive  values  and  areas  with  most 
values  near  zero  will  run  very  long  since  most  sites  will  be 
rejected  most  of  the  time. 

Speed.  Networking,  and  Portability 

Random  number  generators  are  generally  coded  to  be  very 
fast.  In  many  cases,  a  random  number  can  be  generated  as  fast  or 
faster  than  a  number  can  be  fetched  from  an  array  in  RAM, 
particularly  a  multi-dimensional  array.  If  the  array  must  be  sent 
over  a  LAN  (Local  Area  Network)  or  WAN  (Wide  Area  Network)  for 
distributed  simulation,  then  stochastic  indexing  is  much  faster. 

On  the  other  hand,  random  nxunber  generators  are  often 
specific  to  a  particular  computer.  If  all  computers  on  the  net 
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are  the  same,  then  there  is  no  problem.  But  if  they  are 
different,  the  random  number  sequences  can  be  different, 
are  such  things  as  portable  random  number  generators,  but 
say,  that  is  another  story. 
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LISTING  1 


100  REMark  SID  Stochastic  Indexing  Demo  by  A. Boehm  30  Aug  1993 
110  REMark  *******  set  limits.  Try  varying  these.  ******* 
t  120  Seed-898  :REMark  defines  a  new  random  set.  Try  varying 

it. 

130  Total_clouds*100  : REMark  Number  of  clouds, 
k  140  max_puffs=30  : REMark  Max  Number  of  puffs  for  each  cloud 

150  max_i_size=lo  : REMark  max  size  of  cluster,  closeness  of  puffs 
160  max_puff_size=2  : REMark  max  size  of  puff  in  cluster 
170  Nrows-4  :  REMark  Number  of  rows  per  unit  (height  of 

window)  distance 
180  : 

190  WINDOW  512,246,0,0;WINDOW#2,512,10,0,246:PAPER#2,0:INK#2,5 
200  REMark  ************  make  cloud  scape  ************ 

210  MODE  8: PAPER  ItlNK  7:CLS 
220  FOR  i=l  TO  Total_clouds 

230  Ith_cloud  i: REMark  sets  x,y,isize,npuffs_at_i 

240  FOR  j=l  TO  npuffs_at_i 

250  CIRCLE  x+RND* isize , y+RND* isize , RND*max  puff  size 

260  END  FOR  j 

270  END  FOR  i 

280  REMark  *******  find  cloud  i,  puff  j  ******** 

290  REPeat  find 

300  CLS#2:PRINT#2,  'Find  Which  cloud?  Enter  l  to 

' ;Total_clouds, ;INPUT#2,  i 

310  IF  i<l  OR  i>Total_clouds  THEN  BEEP  2000,100:GO  TO  300 
320  PRINT#2,  'Cloud  ';i, 'Which  puff?  Enter  1  to 

' ; npuf f s_at_i ; : INPUT#2 ,  j 

330  IF  j<l  OR  j>npuffs_at_i  THEN  BEEP  2000,100:GO  TO  320 

340  Ith_Cloud  i:REHark  sets  x,y, isize, npuffs_at_i 

350  FOR  k=l  TO  j-l:a=RND:b=RND:c=RND 

360  x=  x+RND*isize;y=y+RND*isize:r=RND*max_puff_size 

370  PRINT  #2, 'Push  s  to  stop  or  another  key  to  go  on.'; 

380  REPeat  blink 

390  INK  0: CIRCLE  x,y,r 

400  INK  7: CIRCLE  x,y,r 

410  a$=INKEY$;IF  a$<>"  THEN  EXIT  blink 

420  END  REPeat  blink 

430  IF  a$=*'S'  THEN  STOP 

440  END  REPeat  find 

450  : 

460  DEFine  PROCedure  Ith_Cloud  (i) 

470  RANDOMISE  i*99+Seed; REMark  sets  new  seed  as  a  function  of  i 
480  REPeat  accept_r eject 

490  X=RND:y=RND 

500  d=COS(2*PI*Nrows*(x-y))  ;IF  d>RND  THEN  EXIT  accept_re j ect 

t  510  END  REPeat  accept_re j ect 

520  x=x*162 :y=y*100:REMark  scale  to  fill  screen 

530  isize=RND*max_i_size 

,  540  npuf fs_at_i=RND(5  TO  max_puffs) 

550  END  DEFine  Ith  Cloud 
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